Effect of Speech Compression on the Automatic Recognition of Emotions
نویسنده
چکیده
This paper investigates the effects of standard speech compression techniques on the accuracy of automatic emotion recognition. Effects of Adaptive Multi-Rates (AMR), Adaptive Multi-Rate Wideband (AMR-WB) and Extended Adaptive Multi-Rate Wideband (AMR-WB+) speech codecs were compared against emotion recognition from uncompressed speech. The recognition methods included techniques based on three different types of acoustic speech parameters: Teage Energy Operator features (TEO), Mel Frequency Cepstral Coefficients (MFCCs), and Glottal Time and Frequency domain features (GP-T&GP-F). The results showed that in general, all three speech compression techniques resulted in the reduction of emotion recognition accuracy. However, the amount of degradation varied across compression methods and types of acoustic features. It was observed that the accuracy of emotion recognition using the AMR-WB technique was higher than the accuracy of the AMR-WB+ and the AMR codecs. Further, the TEO-PWP features showed much more robust performance under different compression rates than the MFCC, GP-T and GP-F features. speech classification
منابع مشابه
A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملEffects of ageing on speed and temporal resolution of speech stimuli in older adults
Background: According to previous studies, most of the speech recognition disorders in older adults are the results of deficits in audibility and auditory temporal resolution. In this paper, the effect of ageing on timecompressed speech and auditory temporal resolution by word recognition in continuous and interrupted noise was studied. Methods: A time-compressed speech test (TCST) w...
متن کاملThe Effects of Culture and Gender on the Recognition of Emotional Speech: Evidence from Persian Speakers Living in a Collectivist Society
This paper reports on a behavioral study that explores the role of culture and gender in the recognition of emotional speech in an under investigated cultural context (a collectivist society: i.e., Iran). Participants were asked to recognize the emotional prosody of a set of validated emotional vocal portrayals (including the five basic emotions). Findings of the experiment were then comp...
متن کاملSpeech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...
متن کاملDesigning and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods
For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...
متن کامل